Search CORE

7 research outputs found

Prophet Attention: Predicting Attention with Future Attention for Image Captioning

Author: Fan Wei
Liu Fenglin
Ren Xuancheng
Sun Xu
Wu Xian
Zou Yuexian
Publication venue
Publication date: 11/04/2023
Field of study

Recently, attention based models have been used extensively in many sequence-to-sequence learning systems. Especially for image captioning, the attention based models are expected to ground correct image regions with proper generated words. However, for each time step in the decoding process, the attention based models usually use the hidden state of the current input to attend to the image regions. Under this setting, these attention models have a "deviated focus" problem that they calculate the attention weights based on previous words instead of the one to be generated, impairing the performance of both grounding and captioning. In this paper, we propose the Prophet Attention, similar to the form of self-supervision. In the training stage, this module utilizes the future information to calculate the "ideal" attention weights towards image regions. These calculated "ideal" weights are further used to regularize the "deviated" attention. In this manner, image regions are grounded with the correct words. The proposed Prophet Attention can be easily incorporated into existing image captioning models to improve their performance of both grounding and captioning. The experiments on the Flickr30k Entities and the MSCOCO datasets show that the proposed Prophet Attention consistently outperforms baselines in both automatic metrics and human evaluations. It is worth noticing that we set new state-of-the-arts on the two benchmark datasets and achieve the 1st place on the leaderboard of the online MSCOCO benchmark in terms of the default ranking score, i.e., CIDEr-c40.Comment: Accepted by NeurIPS 202

arXiv.org e-Print Archive

Recommended from our members

Protected Health Information filter (Philter): accurately and securely de-identifying free-text clinical notes.

Author: Butte Atul J
Fan Xuancheng
Glicksberg Benjamin S
Goldstein Theodore
Ludwig Dana
Muenzen Kathleen
Norgeot Beau
Oskotsky Boris
Peterson Thomas A
Rutenberg Eugenia
Schenk Gundolf
Schmajuk Gabriela
Sirota Marina
Yazdany Jinoos
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

There is a great and growing need to ascertain what exactly is the state of a patient, in terms of disease progression, actual care practices, pathology, adverse events, and much more, beyond the paucity of data available in structured medical record data. Ascertaining these harder-to-reach data elements is now critical for the accurate phenotyping of complex traits, detection of adverse outcomes, efficacy of off-label drug use, and longitudinal patient surveillance. Clinical notes often contain the most detailed and relevant digital information about individual patients, the nuances of their diseases, the treatment strategies selected by physicians, and the resulting outcomes. However, notes remain largely unused for research because they contain Protected Health Information (PHI), which is synonymous with individually identifying data. Previous clinical note de-identification approaches have been rigid and still too inaccurate to see any substantial real-world use, primarily because they have been trained with too small medical text corpora. To build a new de-identification tool, we created the largest manually annotated clinical note corpus for PHI and develop a customizable open-source de-identification software called Philter ("Protected Health Information filter"). Here we describe the design and evaluation of Philter, and show how it offers substantial real-world improvements over prior methods

eScholarship - University of California

Type-IV DCT, DST, and MDCT algorithms with reduced numbers of arithmetic operations

Author: Arai
Arguello
Britanak
Britanak
Britanak
Chan
Chan
Chen
Cheng
Chiang
Crochiere
Duhamel
Duhamel
Duhamel
Fan
Frigo
Gentleman
Gopinath
Hou
Jing
Johnson
Johnson
Kamar
Kok
Krot
Lee
Lee
Lee
Liu
Lundy
Malvar
Malvar
Malvar
Martens
Murthy
Narasimha
Nikolajevic
Painter
Pennebaker
Plonka
Princen
Püschel
Qian
Schatzman
Steven G. Johnson
Suehiro
Takala
Tasche
Vetterli
Wang
Wang
Wang
Xuancheng Shao
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

We present algorithms for the type-IV discrete cosine transform (DCT-IV) and discrete sine transform (DST-IV), as well as for the modified discrete cosine transform (MDCT) and its inverse, that achieve a lower count of real multiplications and additions than previously published algorithms, without sacrificing numerical accuracy. Asymptotically, the operation count is reduced from ~2NlogN to ~(17/9)NlogN for a power-of-two transform size N, and the exact count is strictly lowered for all N > 4. These results are derived by considering the DCT to be a special case of a DFT of length 8N, with certain symmetries, and then pruning redundant operations from a recent improved fast Fourier transform algorithm (based on a recursive rescaling of the conjugate-pair split radix algorithm). The improved algorithms for DST-IV and MDCT follow immediately from the improved count for the DCT-IV.Comment: 11 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

Qwen Technical Report

Large language models (LLMs) have revolutionized the field of artificial intelligence, enabling natural language processing tasks that were previously thought to be exclusive to humans. In this work, we introduce Qwen, the first installment of our large language model series. Qwen is a comprehensive language model series that encompasses distinct models with varying parameter counts. It includes Qwen, the base pretrained language models, and Qwen-Chat, the chat models finetuned with human alignment techniques. The base language models consistently demonstrate superior performance across a multitude of downstream tasks, and the chat models, particularly those trained using Reinforcement Learning from Human Feedback (RLHF), are highly competitive. The chat models possess advanced tool-use and planning capabilities for creating agent applications, showcasing impressive performance even when compared to bigger models on complex tasks like utilizing a code interpreter. Furthermore, we have developed coding-specialized models, Code-Qwen and Code-Qwen-Chat, as well as mathematics-focused models, Math-Qwen-Chat, which are built upon base language models. These models demonstrate significantly improved performance in comparison with open-source models, and slightly fall behind the proprietary models.Comment: 59 pages, 5 figure

arXiv.org e-Print Archive

Methods on COVID-19 epidemic curve estimation during emergency based on Baidu search engine and ILI traditional surveillance in Beijing, China

Author: Fan Guohui
Feng Luzhao
Han Xuan
Hu Xuancheng
Lai Shengjie
Li Zhongjie
Liu Zhimin
Qian Jie
Yang Liuyang
Yang Weizhong
Zhang Ting
Publication venue
Publication date: 06/09/2023
Field of study

Surveillance is an essential work on infectious diseases prevention and control. When the pandemic occurred, the inadequacy of traditional surveillance was exposed, but it also provided a valuable opportunity to explore new surveillance methods. This study aimed to estimate the transmission dynamics and epidemic curve of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) Omicron BF.7 in Beijing under the emergent situation using Baidu index and influenza-like illness (ILI) surveillance. A novel hybrid model (multiattention bidirectional gated recurrent unit (MABG)–susceptible–exposed–infected–removed (SEIR)) was developed, which leveraged a deep learning algorithm (MABG) to scrutinize the past records of ILI occurrences and the Baidu index of diverse symptoms such as fever, pyrexia, cough, sore throat, anti-fever medicine, and runny nose. By considering the current Baidu index and the correlation between ILI cases and coronavirus disease 2019 (COVID-19) cases, a transmission dynamics model (SEIR) was formulated to estimate the transmission dynamics and epidemic curve of SARS-CoV-2. During the COVID-19 pandemic, when conventional surveillance measures have been suspended temporarily, cases of ILI can serve as a useful indicator for estimating the epidemiological trends of COVID-19. In the specific case of Beijing, it has been ascertained that cumulative infection attack rate surpass 80.25% (95% confidence interval (95% CI): 77.51%–82.99%) since December 17, 2022, with the apex of the outbreak projected to transpire on December 12. The culmination of existing patients is expected to occur three days subsequent to this peak. Effective reproduction number (Rt) represents the average number of secondary infections generated from a single infected individual at a specific point in time during an epidemic, remained below 1 since December 17, 2022. The traditional disease surveillance systems should be complemented with information from modern surveillance data such as online data sources with advanced technical support. Modern surveillance channels should be used primarily in emerging infectious and disease outbreaks. Syndrome surveillance on COVID-19 should be established to following on the epidemic, clinical severity, and medical source demand

Southampton (e-Prints Soton)

In-situ formation of ultrafine MgNi3B2 and TiB2 nanoparticles: Heterogeneous nucleating and grain coarsening retardant agents for magnesium borate in Li–Mg–B–H reactive hydride composite

Author: Bosenberg
Deprez
Deprez
Fan
Gianotti
Gosalawit-Utke
Gosalawit-Utke
Guo
He
Hu
Huang
Huang
Ismail
Jiahuan He
Jones
Jones
Kalubarme
Kang
Kelly
Larcher
Lee
Li
Li
Lixin Chen
Ma
Manfrinetti
Mao
Mustafa
Nielsen
Pinkerton
Plerdsranoy
Plerdsranoy
Puszkiel
Puszkiel
Qu
Schlapbach
Shao
Shao
Sharp
Thiangviriya
Utke
Vajo
Wang
Wang
Wang
Wang
Wang
Xia
Xiulin Fan
Xu Huang
Xuancheng Wang
Xuezhang Xiao
Yap
Zhang
Zhang
Zhao
Zhao
Zhendong Yao
Zhu
Zielinska
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Overexpressing fusion proteins of 4-coumaroyl-CoA ligase (4CL) and stilbene synthase (STS) in tobacco plants leading to resveratrol accumulation and improved stress tolerance

Author: AS Dubrovina
BG Ma
C Fan
C Nguyen
C Riviere
C Wang
E Titarenko
Feiyan Xue
G Thiel
H Guo
H Sebai
Huili Guo
I Lekli
IY Sakharov
J Chong
J Schroder
JD Hipskind
JD Lim
K Nandagopal
KJ Livak
KT Howitz
KT Watts
L Bulow
L Kursvietiene
Lanqing Ma
Lulu Zhang
M Chu
M Hasan
M Kato
M Li
M Theodotou
Mingfeng Yang
MR Bell
N Bostanghadiri
OA Aleynova
P Coutos-Thevenot
P Jeandet
R Hain
RB Horsch
RM Horton
RO Sinnhuber
S Cheng
S Hatmi
S Tantong
S Weiskirchen
S Zheng
SY Shin
VL Truong
Xuancheng He
Y Liu
Y Lu
Y Wang
Y Wang
Y Zhang
YJ Jeong
Z Luo
ZK Punja
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref